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Why mobilize data? 


Dmitry Schigel, Laura Russell 


GLOBAL BIODIVERSITY ige¢ Global Biodivers 
A€ GBIF : 


Information Facility 


INFORMATION FACILITY 


¢ International open data infrastructure 

¢ Funded by the governments of the 
participant countries 

¢ Network for free and open access to 
biodiversity data 

¢ 92 participants: 
54 countries and 39 organisations 





htto:/Awww.gbif.org/participation/summary AQ GBIF 
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RD KIPLING | 


“| Keep six honestserving-men 

(They taught me all | Knew); 

Their names are What and Why and When 
And How and Where and Who” 


How the Elephant got his Trunk 
1902 


ECTb Y MeHA LUECTepKa Chyr, 
[1DOBOPHbIX, YHaJIbIx, 

VU Bce, YTO BUY A BOKDYT, 

- Bc@ 3Halo A OT HMX. 


OHU NO 3Haky MOoemMy 
ABNAIOTCA B Hye. 
3OBYT UX: Kak uv Tloyvemy, 
KTo, 4YTo, Korga ul fe. 


A¢ GBIF 


BIODIVERSITY INFORMATION 


Root rot 
Phytophthora cinnamomi 


Banksia serrata woodland 


Banksia jewel beetle 
Cyrioides imperialis Acacia terminalis 
ie Allocasuarina monilifera 
Banksia serrata 
Dillwynia glaberrima 
Epacris impressa 


Leptospermum glaucescens 
Community | Leptospermum scoparium 





1 member 
Pathogen of arene Win | Leucopogon collinus 
stems 


Monotoca glauca 
Philotheca virgata 
Pimelea humilis 








Biology and 
ecology 





Kingdom: Plantae 
Division: Magnoliophyta 
Family: Proteaceae 
Subfamily: Grevilleoideae 
Tribe: Banksieae 
Subtribe: Banksiinae 
Genus: Banksia Lf. 


| Distribution 


Molecular 
biology 





Informatics partnerships - Common standards — Stable infrastructure 





Why share data? 


Why share data? 


21s century = « century of the data » 


Data quantity increases exponentially 


Well curated and standardized, these data 
have the potential to greatly improve our 
Knowledge and capacities 


A GBIF 


Biodiversity Data Use 


Taxonomic research, niche 
modelling/species distribution 
prediction, invasive and alien species, 
habitat degradation, interspecific 
relationships, ... 


But also... 


Conservation biology, water 
management, eco-tourism, science 
history, hunting and fisheries, data 

repatriation,.. 


£ 
After Chapman, 2006 A€ GBIF 


Reasons to share 


Taxonomy 

Biogeographic studies 

Species diversity and populations 
Life histories and phenologies 
Endangered, Migratory and Invasive 
Species 

Impact of Climate Change 

Ecology, Evolution and Genetics 
Environmental Regionalisation 
Conservation Planning 


Natural Resource Management 
Agriculture, Forestry, Fisheries 
and Mining 

Health and Public Safety 
Bioprospecting 

Forensics 

Border Control and Wildlife Trade 
Education and Public Outreach 
Ecotourism and Recreational 
Activities 

Society and Politics 

Human Infrastructure Planning 


gs 
Source: https://docs.google.com/presentation/d/1iY Co3EdqwcD_Nxk5j_IFgGTLFNEzOCub4qyUvxlSMgs/edit#slide=id.g163ca52ab8 1 7 KG BI F 

















GBIF ENABLED 
© SCIENCE TOPICS 
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partnerships 


GLOBAL RELEVANCE 
Biodiversity 


——— wy Indicators 
CEO GROUP ON Partnership 


EARTH OBSERVATIONS 





United Nations 


: v y 
Convention on ( C) Framewark Convention on 


iol 1 Di 11 Climate Change 
Biological Diversity te Chang 
e* cpt on exile 


Data®)\r 
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RED ~ LIFEWATCH 
LIST 


THE IUCN RED LIST 
OF THREATENED SPECIES” 






www.gbif.org  ~=SBIF 


Barriers to data sharing 


Psychological 
and 

cultural 
barriers 


Institutional 
barriers 


Capacity 


barriers 


Practical 
barriers 


Lack of time / planning 





Lack of will 

Perceived loss of control 
Perceived data value 
Perceived data theft 
Privacy concerns 


Lack of authorization 

Lack of policies 

Business models working against data 
Sharing 


Lack of knowledge 
Lack of understanding 


Lack of funding 
Lack of infrastructure 
Lack of human resources 


A GBIF 


Data restriction levels 


1. Refuse to share 


2. Will only share data under specific 
restrictions 


O Embargo: refuse to share until 
they have exhausted the planned 
use of the data 


O Cost: Will only share their data 
for a fee 


3. Agree to share data openly 





ye from University of Sydney https://library.sydney.edu.au/research/data-management/research-data-management.html io GBIF 





SHORT TERM LONG TERM 









perspective 






RESEARCH DISSEMINATION 
PHASE PHASE 
¢ file formats e share with ¢ repository? 
¢* ownership whom? ¢ long-term 
¢ metadata ¢* embargo? manager? 
¢ storage ¢ licensing 
¢ backups * metadata 


Sophie Kay 2013 (Open Science Training Initiative) CC-BY 3.0. 


Why share data? 


https://conservationbytes.com/2018/01/07/to-share-or-not-to-share-is-no-longer-the-question/ 





Corey Bradshaw 


“Even If you have the intention of mining your dataset for more analyses and stacks of new 
manuscripts over the coming years, making it available to the greater research 
community is more likely to make new opportunities rather than stealing them away 
from you.” 


e@ |f you do share, at a minimum you will be cited, but you also might be invited to collaborate 
or co-author 

e Most journals no longer allow you to be a data hoarder 

e Not sharing your data can reduce your opportunities because others don't know what 
you've been doing 


A GBIF 


Photograph by Fabio Teixeira, Picture Alliance via Gett 


Description: The Museum shelters one of the largest exh 


SOUT? AIT Ic 
Endorsed by: 
installations 
Administrative contact 
Technical contact: 


Country or area 


Hosting 


\¢ GBIF © 








PUBLISHER 


Museu Nacional / UFRJ 





SINCE 





6 JANUARY 2014 


i By Bin ome s 


44 CITATIONS 





How to share your data? 


DISCOVERY 


Bibliographic -> 


General -> 


General -> 


Specific, filtered, 
-> 
API access 


Include tables into your publication 
Table 1, Table 2, Supplementary 
materials 


Publish tables on a standalone website 
Instituional, thematic, personal pages 


Archive as-is in a data repository 
Dryad, FigShare, Zenodo etc. 


Format you data to comply with 
international data standards and 
publish through global portals 

NCBI Genbank, GBIF, OBIS 


A GBIF 


What is Data Publishing? 


“Data Publishing” refers to making biodiversity datasets publicly 
accessible and discoverable, in a standardized form, via an access point, 
typically a web address (a URL).” 





Sy m D I ota a Atlas of Living Australia 


ala.org.au 


A GBIF 


What Does IPT Stand For? 


Integrated Publishing Toolkit 


A GBI 


Hosted resources available through this IPT 
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Bo Eh Be 


Name 


Field Museum of Natural History 
(Botany) Seed Plant Collection 
Field Museum of Natural History 
(Botany) Bryophyte Collection 
Field Museum of Natural History 
(Botany) Fungi Collection 

Field Museum of Natural History 
(Botany) Lichen Collection 

Field Museum of Natural History 
(Botany) Ptendophyte Collection 
Field Museum of Natural History 
(Geology) Fossil Invertebrates 
Collection 

Field Museum of Natural History 
(Geology) Paleabotany Collection 
Field Museum of Natural History 


eae ee 
Field Museum of Natural History 
(Zoology) Bird Collection 


Field Museum of Natural History 
(Zoology) Bird Egg Collection 





Fr’ 


Organisation 


Field Museum of 
Natural History 
Field Museum of 
Natural History 
Field Museum of 
Natural History 
Field Museum of 
Natural History 
Field Museum of 
Natural History 


Field Museum of 
Natural History 


Field Museum of 
Natural History 
Field Museum of 
Natural History 


Field Museum of 
Natural History 
Field Museum of 
Natural History 


INTEGRATED PUBLISHING TOOLKIT?" 


free and open access to biodiversity data 


Type 


Occurrence 


Occurrence 


Occurrence 


Occurrence 


Occurrence 


Occurrence 


Occurrence 


Occurrence 


Occurrence 


Occurrence 


Subtype 


Specimen 


Specimen 


Specimen 


Specimen 


Specimen 


Specimen 


Specimen 


Specimen 


Specimen 


Specimen 


email eserves || togin | eens 


£0, foa 


62,149 


22,051 


265,342 


527,634 


20,992 


Last 
modified 


2018-03- 
05 


2018-03- 
05 


2016-03- 
05 


2018-03- 
05 
2018-03- 
05 
2018-01- 
23 


2018-01- 
24 


2016-01- 
23 


2018-02- 
Jf 


2018-02- 
af 


Last Next 
publication publication 
2018-03-05 -- 
2016-12-19 -- 
2018-03-05 -- 
2015-03-05 -- 


2018-03-05 -- 


2017-01-06 -- 


2017-01-27 — 


2017-02-06 -- 


2017-05-04 -- 


2016-02-27 — 


A GBIF 


Data -> IPT -> GBIF 


Getdata Share Tools _ inside GBIF Powe My toon | 


OCCURRENCE DATASET | REGISTERED 10 JULY 2012 


Field Museum of Natural History (Zoology) Insect, Arachnid and 
Myriapod Collection 


Published by Field Museum 








































© Sharon Grant e Crystal Maier 














Ag GBIF INTEGRATED PUBLISHING TOOLKIT!” =~ = (in) eve | 


ridwide Arthropoda (excluding Crustacea) rank fifth in overall size among 


Home worldwide importance for many groups. The collection presently includes 
8.3 million specimens or lots in alcohol or on microscope slides. In 
orted “bulk samples” from traps or leaf-litter extractions. The collection Metadata Last Modified: 5 March 2018 
l | nal 
onal research visitors and borrowers as well as extensive educational use Data Last Changed: 5 March 2018 


Hosted resources available through this IPT apa 


@ Howtocite Ga] 10.15468/0ywtpe 





Filter: 








90% “\ 23% 72% 
Logo Name Organisation Type Subtype Records Last Last Next With taxon match With coordinates With year 
modified publication publication 


Field Museum of Occurrence Specimen 367 2018-03- 2018-03-05 - 
Natural History 05 


in 
J 
































> occurrence.txt - Microsoft Excel 



























































Home Insert Page Layout Formulas Data Review View Field Museum of Occurrence Specimen 62.851 201 8-03- 2018-03-05 ae 
Cut — om, ; See Fx | - 
= ae Calibri “n+ A She) | Bwnptet General . E tl 24 Bad Good a ae Natural History 05 
Paste - . - . se ¢ - . + Conditional Format ‘Neutral Calculation check cell | — Insert Delete F, Sort & Find & _ a 
vt terme eicter | @ FO ~| >| BA) OE | EGE) erence” | S ~ % » | BAR| Cg tmte 7 iaeciae Fiter Select story Field Museum of | Occurrence Specimen 55,370 2018-03- 2018-0305 - 
Clipboard 2 Font A Alignment 2 Number Editing N Hi 05 
P27 “oC fe| PreservedSpecimen v jatural History 
=i z 5 
e 2 = i E z. : : E L ~ “ a x 4| story Field Museum of Occurrence Specimen 70,784 2018-03- 2018-03-05 - 
1 modified language license _rightsHolc accessRig! bibliograr reference institutior collection datasetID institutior collectionCode ownerinstitutionCode basisOfRecord catalogNumber occurrél) - 
2 2017-2-16 en https://cri The Field https://www.fieldmuseum.org FMNH Insects insects-03 FMNH Insects FMNH PreservedSpecimen FMNHINS 0003 040 036 2ction Natural History 05 
3 2017-2-15 en https://er The Field https://www.fieldmuseum.org FMNH Insects insects-03 FMNH Insects FMNH PreservedSpecimen fe-8bf5-b33cb25e76f7 FMNHINS 0002 829 223 
4 2017-2-15'en https://cr The Field https://www.fieldmuseum.orgFMNH Insects _insects-03FMNH _ Insects FMNH PreservedSpec F454d-9022-7e7b7409eb96  FMNHINS 0002829 224 story Field Museum of Occurrence Specimen 399,688 2018-03- 2018-03-05 - 
5 2017-3-24 en https://er The Field https://www.fieldmuseum.org FMNH Insects _insects-O3 FMNH Insects FMNH }-d69e-4b23-945b-04f3e48ddacd4 =FMNHINS 0003 438 496 and Natural Histo 05 
6 2017-2-15 en https://cr: The Field https://www.fieldmuseum.orgFMNH Insects insects-O3FMNH _Insects FMNH 9e9d-e85b-41b5-a972-422704650458  FMNHINS 0002857977 acne 4 
7 2017-2-15en https://cr The Field https://www.fieldmuseum.org FMNH Insects insects-O3 FMNH Insects FMNH 8355e3d0-d78b-4e39-96b7-185fd674223d = FMNHINS 0002858 750 
8 2017-2-16 en https://er The Field https://www.fieldmuseum.org FMNH Insects insects-03 FMNH Insects FMNH 1f6802¢9-949d-435b-8673-9850b61b31e5 FMHD#2016-0002 z s 
9 2017-2-15'en https://enThe Field https://www.fieldmuseum.orgFMNH Insects _insects-03FMNH _Insects FMNH ac32516b-b2d7-4d24-abfb-d59588214568 _ FMNHINS 0002 853.436 story Field Museum of Occurrence Specimen 252,069 2018-03- 2018-03-05 - 
10 2017-2-15 en https://er The Field https://www.fieldmuseum.org FMNH Insects insects-03 FMNH Insects FMNH ervedSpecimen 6d713949-864d-448c-958d-ed20272faS3a FMNHINS 0002 853 437 action Natural History 05 
11 2017-2-15 en https://cr: The Field https://www.fieldmuseum.orgFMNH = Insects = insects-O3FMNH _Insects FMNH PreservedSpecimen 551bf6cd-fa45-4ac5-9d7b-alssb5fs0e6f FMNHINS 0002 995 425 ames 
12 2017-3-24 en https://er The Field https://www.fieldmuseum.orgFMNH Insects insects-O3FMNH _Insects FMNH PreservedSpecimen 36f60a3c-ae3c-4a6e-Sc6e-Sb2624ad6341 FMNHINS 0003 573 796 story Field Museum of Occurrence Spec imen 20,992 2018-02- 2018-02-27 = = 7“ 
13 2017-2-15 en https://cr The Field https://www.fieldmuseum.org FMNH Insects insects-03 FMNH Insects FMNH PreservedSpecimen 8e581860-Oace-4leb-acad-ff0d040517ac FMNHINS 0002 856 778 a U ~ 3 rT © x 
14 2017-2-15'en https://er'The Field https://www.fieldmuseum.orgFMNH Insects _insects-O3FMNH _Insects FMNH PreservedSpecimen 27ab3b7d-b1d2-44ec-bc38-774fcf432026  FMNHINS 0002 856 779 pn Natural History 27 
15 2017-2-15 en https://er The Field https://www.fieldmuseum.org FMNH Insects insects-03 FMNH Insects FMNH PreservedSpecimen 5fff3062-9bb0-4730-b751-8e525736ef7f FMNHINS 0002 821 734 e 3 
16 2017-2-15'en https://crThe Field https://www.fieldmuseum.orgFMNH Insects _insects-O3FMNH _ Insects FMNH PreservedSpecimen 835ae0b9-2e81-4037-83e4-b3f19dcOcaf9_ - FMNHINS 0002 994 146 story Field Museum of Occurrence Specimen 216,325 2018-01- 2018-01-26 = 
17 2017-2-15 en https://er: The Field https://www.fieldmuseum.orgFMNH Insects insects-O3FMNH _ Insects FMNH PreservedSpecimen deidOfff-de6c-46fc-9d74-Sde8d18db2f7 FMNHINS 0002 994 147 on Natural History 26 
18 2017-3-24 en https://cr The Field https://www.fieldmuseum.org FMNH Insects _insects-03 FMNH Insects FMNH PreservedSpecimen 132ab502-bc9d-4547-80d8-d6e941dc5359 FMNHINS 0003 479 584 = Pad 
19 2017-2-15'en https://er: The Field https://www.fieldmuseum.orgFMNH Insects insects-O3FMNH _ Insects FMNH PreservedSpecimen fc52a402-af79-4055-b74c-fd02c849efds FMNHINS 0002 840 632 story Field Museum of Occurrence Spec imen 109,845 2018-01- 2017-11-15 ot 
20 2017-2-15 en https://cr The Field https://www.fieldmuseum.org FMNH Insects insects-03 FMNH Insects FMNH PreservedSpecimen 491¢5246-dfbe-4c5b-blde-6044d3cl86ef FMNHINS 0002 951 105 Nat | Hist 23 
21 2017-2-15 en https://errThe Field https://www.fieldmuseum.orgFMNH Insects insects-03FMNH Insects FMNH PreservedSpecimen 97a62d7-cb13-4fd2-84e0-3d881823a2fa FMNHINS 0002951 106 atural history 
22 2017-5-ST:en https://cr The Field https://www.fieldmuseum.org FMNH Insects insects-03 FMNH Insects FMNH PreservedSpecimen dd579425-040d-463c-b595-3f5994dcd070 FMNHINS 0003 652 489 - : SEE GALLERY 
23 2017-2-15 en https://cr The Field https://www.fieldmuseum.org FMNH Insects insects-O3 FMNH Insects FMNH PreservedSpecimen a9dab59a-4243-45db-b770-9242ffdb6eS2 FMNHINS 0003 008 338 story Field Museum of Occurrence Specimen 527,634 2018-02- 2017-05-04 - 
24 2017-2-15 en https://er The Field https://www.fieldmuseum.org FMNH Insects insects-03 FMNH Insects FMNH PreservedSpecimen 64889596-b40f-49b2-b9e4-7bc50e164671 FMNHINS 0002 821 193 Natural History 27 
25 2017-93-28 en https://cr' The Field https://www.fieldmuseum.org FMNH Insects insects-03 FMNH Insects FMNH PreservedSpecimen 5Sa41ab77-7949-4d63-b085-3a7da80aecS7 FMNHINS 0003 752 846 : : 
2017-2-15'en https://cri The Field https://www.fieldmuseum.orgFMNH Insects _insects-O3FMNH _ Insects FMNH PreservedSpecimen_a081d233-5635-4958-b4be-5d348549ecal  FMNHINS 0002820 049 story Field Museum of Occurrence Specimen 285,342 2018-01- 2017-02-06 - 
2017-2-15' en https://cr' The Field https://www.fieldmuseum.orgFMNH insects insects-O3FMNH _ Insects FMNH 1$4aa2556-C198-478a-b088-47c188d67371 FMNHINS 0002858 089 ‘eptile Natural Histo 23 
28 2017-7-ST.en https://er The Field https://www.fieldmuseum.orgFMNH Insects insects-O3FMNH _Insects FMNH PreservedSpecimen 276a395e-fdb4-4c57-b5f5-b81a97944469 FMNHINS 0003 710 034 epule MY Descri ption 
29 2017-2-15 en https://cr The Field https://www.fieldmuseum.org FMNH Insects insects-03 FMNH Insects FMNH PreservedSpecimen 12c7e243-8170-4541-b028-ff0fb30a5d0d FMNHINS 0002 821 793 
30 2017-10-21en https://cri The Field https://www.fieldmuseum.orgFMNH Insects —_insects-O3FMNH _Insects FMNH PreservedSpecimen 60862a79-e9f5-4f3a-be4d-734890e9680f FMNHINS 0003 765 873 " The Division of Insects’ holdings of worldwide Arthropoda (excluding Crustacea) rank fifth in overall size 
31 2017-3-24 en https://cr'The Field https://www.fieldmuseum.orgFMNH Insects _insects-03FMNH _ Insects FMNH PreservedSpecimen 192ec640-25f2-4da9-9944-3cbbc25ba7d4 __ FMNHINS 0003 623 699 story Field Museum of Occurrence Specimen 22,851 2018-01- 2017-01-27 - among North American collections and are of worldwide importance for many groups. The collection 
2 2015-12-1:en https://er The Field https://www.fieldmuseum.orgFMNH Insects insects-O3FMNH _ Insects FMNH PreservedSpecimen ff36a108-69e6-43e2-becc-8d11dabd3726 FMHD#1981-0014 ‘ H 3 é é 
a pi 5 » 
2 lection Natural History 2 resently includes roughly 4.1 million pinned insects plus 8.3 million specimens or lots in alcohol or on 
33 2017-2-15 en https://cri The Field https://www.fieldmuseum.org FMNH Insects insects-03 FMNH Insects FMNH PreservedSpecimen 838558dc-9ff2-4739-bdf5-c3af215eeac4 FMNHINS 0000 116 488 microscope slides. In addition, there are over 17,000 partly-sorted “bulk samples” from traps or leaf-litter 
34 2017-2-15 en https://criThe Field https://www.fieldmuseum.orgFMNH Insects _insects-03FMNH _ Insects FMNH PreservedSpecimen 06b9d49b-f162-4b62-8009-049ee1316ccf FMNHINS 0002 828 382 ¥ - seu ; f 06 = : —— sh eeelabaat y ‘ 
ia 4 > | occurrence  €3 0K) m ae ra story Field Mu ‘ m of Occurrence Specimen 62,149 2018-01 2017-01 extractions. The collection receives heavy use by US and international research visitors and borrowers 
Beaty.| 100% O——V= tes Natural History 23 as well as extensive educational use. 
Collection 
Field Museum of Natural History Field Museum of Occurrence Specimen 115,932 2018-03- 2016-12-19 - Geographic coverages 
(Botany) Bryophyte Collection Natural History 05 ae 
joba’ 
Showing 1 to 14 of 14 @ previous next B 


The most recently updated resources are also available as an RSS feed. §}. Additional info 


https://www.fieldmuseum.org/field-museum-natural-history-conditions-and-suggested-norms-use- 
collections-data-and-images 
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174 installations 
52 countries 
211 checklists 
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2,051 occurrence datasets 
55 sampling event datasets 
100+ million records 


BENEFITS OF OPENESS 


Increases the efficiency of research 

Promotes scholarly rigor and quality of research 

Enables tracking of data use and data citation through DOIs 
Expands the spectrum of academic products through data papers 
Enhances visibility and scope for engagement 

Enables researchers to ask new research questions 

Enhances collaboration and community-building 

Increases the economic and social impact of research 


International conventions and requirements from funding agencies 


Piwowar et al. 


(2007) 
Content CC-BY-2.0 


OPEN a ACCESS Freely available online 


® pros one 


Sharing Detailed Research Data Is Associated with 


Increased Citation Rate 


Heather A. Piwowar*, Roger 5. Day, Douglas B. Fridsma 


Department of Biomedical Informatics, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, United States of America 


Background. Sharing research data provides benefit to the general scientific community, but the benefit is less obvious for 
the investigator who makes his or her data available. Principal Findings. We examined the citation history of 85 cancer 
microarray clinical trial publications with respect to the availability of their data. The 48% of trials with publicly available 
microarray data received 85% of the aggregate citations. Publicly available data was significantly (p= 0.006) associated with 
a 69% increase in citations, independently of journal impact factor, date of publication, and author country of origin using 
linear regression. Significance. This correlation between publicly available data and increased literature impact may further 


motivate investigators to share their detailed research data. 


Citation: Piwowar HA, Day RS, Fridsma DB (2007) Sharing Detailed Research Data ls Associated with Increased Citation Rate. PLoS ONE 2(3): e308. 


doi:1 0.1371 /journal.pone.0000308 


INTRODUCTION 
research data-sample attributes, clinical factors, patient outcomes, 
DNA i ii raw mRNA — measurements—with 


“sont their aad analysis[ 1] . in addin to rene aa to 
confirm original results, raw data can be used to explore related or 
new hypotheses, particularly when combined with other publicly 
available data sets. Real data is indispensable when Investigating 
and developing study methods, analysis techniques, and software 
implementations. he larger scientific community also benefits: 
sharing data encourages multiple perspectives, helps to identify 
errors, discourages fraud, is useful for taming new researchers, 
and increases efficient use of funding and patient population 

Believing that that these benefits outweigh the costs of sharing 
research data, many initiatives actively encourage Investigators to 


make their data available. Some journals, includmg the PLoS 


family, require the submission of detailed biomedical data to 
publicly available databases as a condinon of publication|?—4]. 
Since 2003, the NIH has required a data sharing plan for all large 
funding grants. he growing open-access publishing movement 
will perhaps increase peer pressure to share data. 

However, while the general research community benefits from 
shared data, much of the burden for sharing the data falls to the study 
investigator. Are there benefits for the 1 investigators themselves? 

A currency of value to many investigators is the number of times 
their publications are cited. Although limited as a proxy for the 
scientific contribution of a paper|5], citation counts are often used 
in research funding and promotion decisions and have even been 


ed =a. alarikcrerweue mlloawr araleee el) Ebates rade ic 


RESULTS 

We stucied the citations of 85 cancer microarray clinical trials 
published between January 1999 and April 2003, as identified in 
a systematic review by Ntzani and loannidis[7| and hsted in 
Supplementary ‘Text 51. We found 41 of the 85 clinical trials 
(48%) made their microarray data publicly available on the 
internet. Most data sets were located on lab websites (28), with 
a few found on publisher websites (4), or within public databases (6 
in the Stanford Microarray Database (SMD)[8], 6 m Gene 
Expression Omnibus (GEQO)/9|, 2 in ArrayExpress|10], 2 in the 
NCI GeneExpression Data Portal (GED P\gedp.nci nih gov); some 
datasets in more than one location). ‘The internet locations of the 
datasets are listed in Supplementary Text 52. ‘The majority of 
datasets were made available concurrently with the tral 
publication, as illustrated withm the WayBackMachine internet 
archives (www.archive.org/web/web.php) for 25 of the datasets 
and mention of supplementary data within the trial publication 
iselffor 10 of the remaiming 16 datasets. As seen in ‘Table 1, trials 
published in high impact journals, prior to 2001, or with US 
authors were more likely to share their data. 

‘The cohort of 85 trials was cited an aggregate of 6239 times in 
2004-2005 by 3133 distinct articles (mecian of 1.0 cohort citation 
per article, range 1—23). The 48% of trials which shared ther data 
received a total of 5334 citations (85% of aggregate), distributed as 
shown in Figure 1. 


Academic Editor: John loannidis, University of loannina School of Medicine, 
Greece 


Received December 13, 2006; Accepted February 26, 2007; Published March 71, 
2007 


ata access 


© GBIF.org: search, browse a 
download 
¢ Occurrences 
¢ Species 
¢ Datasets 
¢ Publishers 


¢ Countries 


GBIF API - machine friendly e- 
access 


¢ External systems 





DATA ACCESS —- FREE AND OPEN TO ALL 


inside GOIF 





14 OCTOBER 2016 


Citation guidelines 


These guigcitncs Sroside the most coranmon exannbies of sttation by CBIF sscrs. 


The practice of citation serves two primary purposes: to acknowledge the original source of Information 
anc to help other researchers find that source. As an open data research infrastructure, GBIF 
ancourages good ci 
reproducibiltty. 





Mian practices lo ensure proper credit and attribution as well as transparency and 





Below you'll find gukielines far the most common cases af citation by GRIF users. While these are 
presented In Harvard style, please feel tree to adapt citations to the style format required by your 
mstitution, publisher or agency. However, please do include each element of content from the relevant 
example, especially the DOI fink, URL and date. 





Citation examples 


Occurrence data download through GBIF.org 





Whang. cerieiaces.s ger downloads data from GBIF or, s/he is redirected to a vane that includes the 


When using this dataset please use the following citation: 
GBIF org (29th February 2016) GRIF Occurrence Download http/dolorg/10.15468/dl.ywhpme 


This citation also appears In 2 confirmation sent to the emall account that the user registered with. 


By using the assigned DOIs tcluded with your citations, you vastly improve GBRIMs ability to track the 
use of dats, which we can then report to data publisher. ft also provides the mechanism for connecting 
published uses of the data back to its sources. In addition to acknowledging ther, the practice of using 
DO! citations rewards publishers by reinforcing the value of sharing open data to the publisher's 
stakeholders and funders. 


Individual checklist, occurrence or sampling event dataset 


Fach dalaset page cantains a paragraph that provides a Gefault citation, for ¢ 







Get data Share Tools inside GBIF Q 





DEVELOPER APIDOCS 


API Summary 


bttp:/ / apt. ghif.org/ v1 / 


SUMMARY REGISTRY SPECIES OCCURRENCE MAPS NEWS 


The GBIF API is a RESTful JSON based API. The base URL for v1 you should use is: http://api.gbif.org/v1/ 


The API should be considered stable, as should this accompanying documentation. Please report any issues you find with either the API itself or the documentation using the “feedback” 
button on the top right 


Content 


Sections 


API Sections 
The API is split into logical sections to ease understanding 


Communication Registry: Provides means to create, edit, update and search for information about the datasets, organizations (¢.9 


data publishers), networks and the means to access them (technical endpoints). The registered content controls 


Common operations 
what is crawled and indexed in the GBIF data portal, but as a shared AP! may also be used for other initiatives 


Authentication Species: Provides services to discover and access information about species and higher taxa, and utility services 
for interpreting names and looking up the identifiers and complete scientific names used for species in the GBIF 
Enumerations portal. 
rence: Provides access to occurrence information crawled and indexed by GBIF and search services to do 





Roadmap to v2 
P real time paged search and asynchronous download services to do large batch downloads 


Maps: Provides simple services to show the maps of GBIF mobilized content on other sites. 
Various 


BARES, “AT IO tenn tenn infneenntinn mun am nannen meihlinhad sninn ADIT emhilinad anntant fre 


Communication 


You can sign up to the GBIF AP! users mailing list to post your questions, and to keep informed about the API. We will 
announce new versions, and scheduled maintenance downtimes before they happen 
Content . = iste 
Sections We welcome any example uses of the API to guest feature on the GBIF developer blog 


Feedback from developers on the API can be sent to informatics 





Communication 


Common operations Common operations 
Authentication The following details common cross-cutting parameters used in the API 
Enumerations Paging 


Roadmap to v2 For requests that support paging the following parameters are used 


How does it work (for you)? 
Seven steps of publishing data through GBIF: 





A GBIF 
. Agree with you administration 

. Register your institution / collection 

. Understand what data type you are dealing with (4 types) 

. Standardize / format your data (export) 

. Choose your way to publish 

. Check your data 

. Publish! 


DATA PUBLISHING 


NO UhWN FE 


GBIF member nodes endorse institutions to share data 
¢ Optionally enforcing data quality thresholds 
¢ Judgment on a case by case basis 
Institutions publish datasets on GBIF 
¢ Retain ownership 
¢ Referenced as the “publisher” 
Institutions often offer assistance to others in separate agreements 
¢ Hosting services 
Chose your data license 
¢ CCO and CC BY - 90% datasets, 85% records 
¢ CC BY-NC - 7% of datasets, 10% of records 





A¢ GBIF 


THANK YOU 


Global Biodiversity Information 
Facility (GBIF) 
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RAEVLING, EMPETRUM NIGRUM. 


Image: Crowberry (Empetrum nigrum). Billeder af nordens flora. v.1 Kabenhavn,G.E.C. 


Gad's forlag,1917-1927. biodiversitylibrary.org/page/10459683 


241] 


Discussion 


Discussion 


What are the barriers to open sharing of data trom 
your areas? 


How do you think these barriers can be overcome? 


A GBIF 





